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Abstract. We study a non-Markovian random walk in dimension 1. It depends 
on two parameters cr, and cl, the probabilities to go straight on when walking to 
the right, respectively to the left. The position x of the walk after n steps and the 
number of reversals of direction k are used to estimate cr and £l- We calculate the 
joint probability distribution p n (x , k ) in closed form and show that, approximately, it 
belongs to the exponential family. 
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1. Introduction 

Consider a random walk starting in the origin x = 0 of the lattice Z. The probability 
that after n steps the walk is in x and changed its direction k times is denoted p n (x, k). 
This paper investigates the question how p n (x, k) depends on model parameters. We 
wonder whether it can be written into the form 

Pn(x, k) = D n (x, k) exp(G + /3k + Fx). (1) 

In this expression, G, (3 and F depend on model parameters. However, the prefactor 
D n {x , k) does not depend on model parameters. The function (3 has the interpretation 
of an inverse temperature (in dimensionless units), the function F is an external force, 
the function G , when divided by /?, is a free energy and serves to normalize (HD- a 
probability distribution p n (x, k) of the form (JTJ) is said to belong to the exponential 
family. It has nice properties. In particular, averages of x and k can be calculated by 
taking derivatives of G with respect to the parameters. 

Random walk models are omnipresent in statistical physics and have been studied 
extensively. Quite often results are obtained in the limit of large n. Here, the focus 
is on all n. Deviations from ©, found below, are neglegible in the large n-limit. 
Standard techniques aim at calculating correlation functions. It is rather seldom that 
exact expressions for probability distributions can be written down in closed form. In 
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the present model such closed form expression exists for p n (x, k ), but probably not for 
the marginals p' n (x) = ^2 k Pn(x, k) and p”{k) = ^2 x p n ( x ,k). Individual events have 
usualy such a small probability that they cannot be evaluated numerically. In addition, 
in situations with a large number of degrees of freedom, knowledge of p n (x,k) is not 
sufficient to evaluate moments of the distribution in closed form. However, if a closed 
form expression of p n (x, k) is available then analytic relations can be used to evaluate 
relevant quantities. 

The model, considered here, is that of a one-dimensional persistent random walk 
(see e.g. [T]) with drift. Many generalizations of the persistent random walk can be 
found in literature, e.g. for continuous time [2], or with a memory that goes back more 
than one step [3]. Planar persistent random walks have been studied in HE!- 

Our model is a toy model that helps to understand features of more realistic 
models used in several branches of physics. One such application, well-known since 
the pioneering work of Flory |)j, is the use of random walks to model the geometry of 
polymers. Persistent random walks play also a role in understanding the transition from 
ballistic to diffusive transport nnumEi and have been applied in financial physics, 
see e.g. [131. 

Recent technological progress has made it possible to do experiments on single 
molecules and to measure the elongation of a single polymer (x) as a function of applied 
force F (see e.g. 0BE!). The analysis of these experiments is based on the assumption 
that p' n (x) is proportional to exp(— /3V(x) + Fx ) for some potential V(x). This relation 
follows from CD with exp(— (3V{x)) = D n (x, k)ed k . The latter expression shows 
that V(x) is indeed a free energy, as claimed in [5]. An in depth discussion of these 
experiments based on the results of the present paper is found in j THf j. 

In the next section the model is introduced. In Section 3 average values for position 
x and number of reversals k are calculated using the method of generating functions. 
In Section 4 the number of walks ending in x after n steps, and having a given number 
of reversals k, is calculated. These counting results are used in Section 5 to write 
down the joint probability distribution p n (x,k). Section 6 considers the dependence 
of p n (x, k ) on the parameters 6 r and 6 l and tries to answer the question whether this 
two-parameter probability distribution function (pdf) belongs to the exponential family. 
Section 7 shows how to calculate averages starting from the knowledge that the pdf is 
exponential. The final section gives a short discussion of the results. 

2. Model 

Consider a discrete-time random walk on the one-dimensional lattice Z. The probability 
of the walk to step to the right (i.e., with increasing position) equals 6 r when coming 
from the left and 1 — 6 l when coming from the right. This is not a Markov chain since 
the walk remembers the direction it comes from. Let x n be the position of the walk 
after n steps. Let a n = x n — x n _i be the direction of the n-th step. Then x n+ \ = x n + 1 
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with probability 

-(1 + (T n )eR +-(1 - cr n )(l - e L ), (2) 

x n+ i = x n — 1 otherwise. The process of the increments a n is a two-state Markov chain 
with transition matrix 


P 


CR 1 — Cr 

1 — £l e L 


(3) 


In the stationary state a n equals ±1 with probability given by 


(o) 

P+ = 


cl 


(o) 

p_ = 




9 r j-- 9 - (4) 

2 — — 6l 2 — 6r — 6l 

Let k n denote the number of reversals of the walk after n steps. By definition a 

reversal occurs at step n if a n -\a n = —1. Hence one has 


x n — ^ <Jj 

3 = 1 
1 n 

k n = -^( 1 -^- 1 ^). 
3 =1 


(5) 

( 6 ) 


The quantity of interest in this paper is the joint probability of position x n and number 
of reversals k n . The obvious initial conditions are Xq = 0 and cr 0 = ±1 with probability 



The physical interpretation of the model is twofold. The random walk is a simple 
model of a polymer with n units. Energy is proportional to minus the number of reversals 
k n . The position of the end point x n measures the effect of an external force applied to 
the end point. Alternatively, k n is the number of domains (decreased by 1 if (j\ — <7 0 ) of 
an Ising chain, and x n is the total magnetization. Indeed, the variables cr n describe Ising 
spins on a one-dimensional lattice. A domain is then a set of subsequent sites where the 
spins all have the same value, either up (+1) or down (-1). The boundary between two 
domains involves a reversal {(jj-\Oj = —1). 


3. Generating functions 

Let p ± (n,x,k ) denote the probability that a n = ±1, x n = x, and k n = k. The joint 
proability distribution, searched for, is then 

Pn(x,k) =p+(x,k) +p-(x,k). (7) 

The following recursion relations hold 

Pn(x, k) = e R - 1, k) + (1 - ei^p^-iix -l,k-l) (8) 

Pn(x, k) = (1 - e R )p^ 1 (x + l,k-l) + e L p+_!(a; + 1, k). (9) 

Introduce generating functions 

n n 

f±\w,z) = wX ^2 zk Pn( X ^ k )- 

k =0 


x=—n 


( 10 ) 
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and a similar expression for f( n \w,z). They satisfy 


f+(n) 
f -( n ) 


= M(w, z ) 


f+( n ~ 1) 


( 11 ) 


with 


M(w, z) = 


( 12 ) 


e R w (1 - e L )wz 

(1 - e K )z/w e L /w 

It is possible to calculate the n-th power of this matrix by first diagonalizing it. The 
result is 


M n (w,z) = -{Xl + Xl) 


1 0 
0 1 


with 


and 


+ ^(A" + - A") 


A± = -(e R w + e L /w ± v) 


we R -e L /w 2(1 - e L )wz 
2(1 - e R )z/w -we R + e L /w ) ’ 


(13) 


(14) 


v = \] (e R w - e L /w) 2 + 4(1 - e R )(l - e L )^ 2 . (15) 

Let us now consider initial values. Note that f±\w,z) is not yet defined because 
ko involves <r_i, which is undetermined. Starting point is therefore f±\w,z ), which is 
found to be given by 


Hence, it is obvious to define 


The generating function f^ n \w,z) is now explicitly known 

_ 1 


(w,z) I 

(vf 
V p (0) 

p+ \ 
(0) 

P- ) 



(16) 


(17) 


/ (n) K z) = -(A” + A”) + -(X n + - A”) [(e R w - e L /w)(pf - p W ) 


+ 2(1 - e R )p ( ^z/w + 2(1 - 6i)p [ L , wz 


(o)„ 


It can be used to calculate expectation values by taking derivatives. For example, 

W=Z=1 

(1 — c R )(l — c L ) 


(18) 


(k n ) = 


dz 
= 2 n 


and 


{x n ) = 


d_ 

dw 


2 — e R — Cl 


f (n \w,z) 


(19) 


= n- 


w=z=l 

e R — cl 
2 — e R — cl 


( 20 ) 
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The present section is temporarily limited to the special case 6 r = cl = 1/2. From the 
next section on the general model will be considered again. Indeed, we first determine 
the number of walks c±(n,x,s) which, starting in the origin in direction ±1, end in x 
after n steps and have s segments. The result does not depend on the value of cr and 
6 r. Hence the calculation can be done in the simplest case. In the next section the 
result will be used to calculate the joint probability distribution p n (x, k ) for the general 
model. 

Divide the walk into segments of constant aj. Number these segments from 1 to 
s n . Note that s n — k n + 1 if G\ — cr 0 , s n = k n otherwise. This means that the number 
of segments equals 1 plus the number of reversals, not counting the initial reversal at 
x = 0, if present. Let t 3 denote the length of the j'-th segment. The probability that 
segment j has length l equals 2~ l . The probability of counting s segments in a walk of 
n steps satisfies 

S — 1 S 

V(s n = s)=v(^Tj<n< 

3=1 3=1 

oo oo 

— y ^ • • • y ^ 2+ • • •+ i s ~i < n < /i + • • ■+/ s } 

h =l ls =l 

n—1 oo oo oo 

= E 2 ~” E ■" E w + - + i.-, E 2 "'' 

m=s —1 / i=l l s _i = l l s =n—m 

n— 1 oo oo 

= E E■ ■■ E W+- 

m=s— lii=l l s -i=l 

= 2 -(n- 1 )^-^ ( 21 ) 

Hence the conditional probability given a certain number of segments equals 

V(ti = /i,---,r s _i = 1 S - I \s n = s) = ^ . (22) 

This means that the variables Tj, after conditioning on a given number of segments, 
become uniformely distributed. This observation simplifies the following calculation. 

The position x n of the walk after n steps, assuming s n segments, can be expressed 
into the segment lengths as 

Sn 1 

x n = <7i (n - t 2 4-± T Sn _ 1 T {n - do) • ( 23 ) 

3=1 

The ±-sign depends on whether the number of segments s n is even or odd and equals 
(—l) Sn . One obtains 

x n = a i ^2(ri + r 3 H-b r Sn _i) - nj if s n is even 

= o’! (n - 2(r 2 + r 4 H-b r Sn _i) j if s n is odd. (24) 
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For simplicity let us first consider the case of an even number of segments. Let 
s > 0 be even. Then one has 


V{x n = x,s n = s ) 

= V(s n = s)V(t i + r 3 -h r,s_i = (n + a 1 x)/2 \ s n = s) 

oo oo s —1 

= l i <n,h + h + --- + Is- 1 = (■n + dx)/2} 

h=l Is- 1=1 i=l 

OO OO OO 

= 2 -(»-) EE ... E 

^{h + ^3 + ''' + h -1 — ( n + c r i3 ; )/2} 

il = l ^3 = 1 ia-l=l 

OO OO oo 

I{^2 + ^4 + * * * + l s —2 < (n — (Jix)/2} 

^2 — 1 U = 1 Z s —2=1 

= 2 - (n - 1} 0 ^ (x, 8 ) 0(71 + X - 2 , s - 2)0(77 - x - 2 , s - 2 ) ( 25 ) 


with 


0(71, 777 ) = 


77 ! 


(26) 


777!! ((77 — 777)!! 

and with 0±(x, s) equal 1 if there exists a walk of n steps, starting in the origin in 
direction ±1, ending in x, and containing s segments, and zero otherwise. 

If 8 is odd, 8 > 3, then one has 


V(x n = x,s n = s ) 

= V(s n = s)V(t 2 + 7-4-b T s - 1 = (77 - <7\x)/2 I 8 „ = s) 

00 00 

= 2- (n - 1) E ■" E 

(1=1 q_i=i 

5—1 

x ^ lj < 77, ^2 T ^4 T ■ ■ ■ T Z S _1 = (77 — ( 7 ix)/ 2 } 
i=i 

00 00 00 

= 2- < "- ,, EE- e 

il = l /3 = 1 (s —2 = 1 

X I{/i + ^3 + ’ • • + / s _2 < (77 + CTiX)/ 2 } 

00 00 00 

xEE-E I{^2 + I 4 + ' ' • + Is-1 — { n — C r iX)/2} 

^2 — 1 ^4 = 1 / s _ 1 = 1 

= 2 _< ' n_1 ^0(( 1 (a:, s)0(?7 + cyx — 2, 8 — 1)0(77 — <7ix — 2, s — 3). 

(27) 


Note that, in case of an odd number of segments, the number of walks ending in x n 
depends on whether the walk starts to the left or to the right. 

Finally, if s = 1 then there is clearly only one walk ending in the point x n . 


5. The joint probability distribution 

Let us return to the general case with arbitrary and cl- The probability of a given 
77 -step walk depends only on er 0 , and on the final values x n and k n . To see this, note 
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that a segment of length r has probability (1 — if the direction is positive, 

and (1 — e^e^ -1 if the direction is negative. A factor 6 r can be associated with every 
step to the right, and 6 l with every step to the left. But then a factor (1 — 6 l)/6r, 
respectively (1—CR)/eL, must be associated with every reversal of direction from leftgoing 
to rightgoing, respectively rightgoing to leftgoing. 

The number of steps to the right respectively to the left is (n + x n )/2, respectively 
{n — x n )/2. The number of reversals from leftgoing to right going is denoted k~, from 
rightgoing to leftgoing They depend on whether the number of reversals is even or 
odd. If k n is odd then 

= (kn ± <to)/2. (28) 


Obviously is k~ + k+ = k n and k+ — k~ — (To- On the other hand, if k n is even then the 
number of reversals is k~ = = k n / 2, independent of the direction cr 0 . In both cases, 

the probability of the n-step walk, given that it ends in x, has k + reversals when going 
right and k~ when going left, is 


7n(®> k + ,k~ 


= e 


n-\-x 

R 2 « 


1 ~ €R 


fc+ 


1 — 

£r 


(29) 


The number of such walks is denoted D°° (x, k + ,k ) and equals 

D°°(x, k + , k~) = ®n°+i( x + °b, k + + k~ + 1 )E a °(k + , k~) 
x C(n — 1 + x + Co, 2 k~ + ao — 1) 
x C(n — 1 — x — no, 2 k + — (Jo — 1). 

(30) 


The function S cro (fc + , k ) equals 1 if k + = k or k + — k = Jo and zero otherwise. To 
see from where PI) follows, consider a walk of n + 1 steps starting at position — <r 0 and 
apply the results of the previous section. Note that the number of segments of this walk 
is k + + k~ + 1. 


The final result for the joint probability distribution p n (x, k ) is then 
Pn(x, k) =p+ ] q+(x,k) +p W q~(x, k ) 
with the probability distribution qpx,k) given by 


(31) 


q±(x,k) = 


= Dt 


k k 


x >2> 2 


x , 


In 

k± 1 k -F 1 


k k 
X ’ 2’2 


In 


X, 


k even 
k± 1 k -F 1' 


k odd. 


(32) 


As an example let us calculate 

p 4 (2, 3) = p^D+i 2, 2,1)74(2, 2,1) + p^D7( 2,1, 2) 74 (2,1, 2). (33) 


One has 


Dt( 2,2,1) = 9+(3,4)S + (2,1)C(6, 2)C(0, 2) 
£>*(2,1,2) = e 5 -(l,4)H-(l,2)C(4,2)C(2,2). 


(34) 

(35) 
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74(2,1,2) = e R (l-e R )(l-e L ) 2 


(36) 


one obtains 



(37) 


This example shows that sometimes only one of the two terms contributing to the 
r.h.s. of (Cm does not vanish. 

6. Exponential family 

One can write 



(38) 


with 



(39) 


0 = - bn ---- 

2 (1 — 6 r )(1 — £ l ) 

_ n , 


(40) 


(41) 


G = -lne R e L . 


This allows us to write q±(x, k), appearing in our main result (HTTl) . as 



(42) 


where 



(43) 


and with A = 1 if k is odd, and zero if k is even. Hence the probability 
distributions q±(x,k) belong to the exponential family, however not with two but 
with three parameters f3, F, and 7 . The third parameter 7 controls boundary effects. 
Hence, p n (x,k ) is a superposition of two distributions q±(x,k), both belonging to the 
exponential family. However, the domains on which these two pdfs differ from zero are 
not identical. 

If n is large then the variable A can usually be neglected, being small compared to 
typical values of k and x. One obtains the approximate result that, for those values of 
x and k for which p n (x, k) 7 ^ 0 , 



(44) 


This shows that p n (x, k) approximately belongs to the exponential family with two 
parameters [3 and F. Deviations between l.h.s. and r.h.s. of (HI occur for two reasons: 
there is a subtle difference in expressions for even k and for odd k, and there is a small 
dependence on the initial condition cr 0 . 
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Let us now see what exponential expressions are good for. 
approximate expression (HU). From kPn( x ^) = 1 follows 

First consider the 

f) AC 1 

°^I2gp Pn{X ’ k)= W +{kn) 

n 

(45) 

0 - S *0 = -Qp + M- 

n 

(46) 

Using (HT1) there follows 


n 1 . 1 , . 

0 =- 7 - r(k n ) H- ( x n) 

2cr 2cr(1 — Cr) 2cr 

(47) 

n 1 1 

0 =-7- r(k n ) -(x„). 

2 e L 2 e L (l — e L ) ' ' 2 e L X n/ 

(48) 


When solving these equations for (k n ) and (x n ) one recovers (ITU HH) . Hence, from 
the approximate result ( 1111 ) . which one can guess whithout hard work, one obtains 
immediately exact results for the averages ( k n } and (x n ). 

Let us now try to do the same starting from the exact expressions pit |121). From 
the normalization of q±(x, k ) follows the set of equations 

, „ ^ , ,.u . (A)± 


0 = 
0 = 


2cr 

n i 

2cl 2cl(1 — cl) 
They can be written as 

,± __ cr — £l 




(fcn) ± - — {Xn^ =F 


1 — 2e T 


(x n ) = n - 
(/c n ) ± = 2 n 


cr — £l 


T 2 - 


2e L 


cr — Cl 


2cl(1 — cl) 


(A) = 


(1 - cr)(1 - cl) 


T 


-(A) ± 

cr — cl 
cr — cl 


(A) : 


(49) 

(50) 

(51) 

(52) 


2 — Cr — Cl 2 — 6r — Cl 
For large n, the effect of the terms in A is small, as can be seen from these equations. 
Comparison with caisi gives 



(53) 


Hence one can write 

<A} + = 2 ~ fR ~! L <A) (54) 

2(1 - e L ) 

< 55 > 

It is however not clear how to obtain a closed form expression for (A), which is the 
probability that k n is odd. 

The averages (EUE2) are calculated with boundary conditions oq = +1 or cr 0 = — 1. 
The expressions are the sum of a part independent of the boundary condition and a 
small contribution which depends on the boundary condition and on the probability 
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Figure 1. Average position (x), divide by n , as a function of external force F for two 
different values of /3; based on Equations mmm- 


(A) that the number of reversals k n is odd. An annoying consequence of the fact that 
the probabilities k) belong to the exponential family with three parameters instead 
of two is that the averages of x n and k n cannot be obtained from (E21l by simple taking 
of derivatives. Even when the results of Section 3 are invoqued, not all quantities can 
be determined. In particular the probability (A) cannot be obtained in this way. 

Simple random walk corresponds with the choice 6 r = 6l = 1/2. This implies 
infinite temperature (i.e. vanishing (5) and absence of drift (F — 0). The third parameter 
7 vanishes as well. Also random walk with drift is a special case, corresponding with 
6 r -\- 6 l = 1. Again, (3 = 0 and 7 = 0 follow. A persistent random walk is obtained when 
6 r = £ l - This implies F — 0, but non-vanishing f3 and 7. 

8. Discussion 

We have studied a simple model of random walk depending on two parameters 6 r and 
cl - The parameters are estimated using the position x n of the walk after n steps, and 
the number of reversals of direction k n . The technique of generating functions is used to 
calculate averages (x n ) and (k n ). Next, explicit expressions are obtained for the number 
of walks that end in the same position x n and have the same number of reversals k n . 
These counts are used to write an explicit result m for the joint probability distribution 
p n (x, k). I 11 the final part of the paper we try to write this joint probability distribution 
in the form of an exponential family. This succeeds only in an approximate manner. 
The distribution p n (x, k) is a superposition of two pdfs q n (x, k), both belonging to the 
exponential family, but with three parameters instead of two. The third parameter 
controls the probability that the number of reversals is odd. The difference between 
walks with even or odd number of reversals vanishes in the limit of large n. 

Some of the main results of the paper are explicit expressions m EH for 
thermodynamic parameters (3 and F in terms of the model parameters €r and e\ j . They 
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are used in Figure 1 to plot average position as a function of external force F. The latter 
quantity can be measured experimentally. Our result shows a typical sigmoidal curve, 
in qualitative agreement with the measurements of [Zj. A more profound discussion of 
the implications of the present work is found in .HI. 

The calculation of the probability (A) that the number of reversals is odd is an 
open problem. Also did we not succeed to obtain a closed expression for the marginal 
distribution p' n (x) = ^2 k p n ( x , k) of the position of the walker. Note that the result 
of Section 4, counting walks with given x and k, is not needed in later sections to 
derive expressions for average position and average number of reversals. This is a 
positive consequence of knowing that the parameter dependence of the probability 
distributions of (1H1) is exponential. There is good hope to find distributions belonging 
to the exponential family also in more general models, because exact relations can be 
derived, even in cases where counting walks would raise an unsurmountable problem. 

Deviations from exponential distribution, as in expression (1441) . are due to memory 
effects. The walker remembers initial conditions, even if these are carefully chosen. 
Reason here is that the process is non-markovian. Of course, these effects are negligible 
when the number of steps n is large. In many realistic models long range interactions 
produce memory effects which remain important for large n. E.g., in polymers the 
excluded volume effect causes long range interactions. Such models are less suited for 
rigorous analysis. We expect that deviations from exponential dependence, found for 
finite n in the present model, will occur in models with long range interactions, even in 
the limit of large system size. 
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